Development of Story Text-to-speech System Based on Story Genres

نویسندگان

  • Parakrant Sarkar
  • K. Sreenivasa Rao
  • Gurunath Reddy
چکیده

Storytelling is a distinct speaking style which embraces various expressive subtleties aimed to draw the attention of the children. Studies have shown that Text-to-speech (TTS) systems have the tendency of not conveying the right emotion expressivity in their speech outputs. The objective of this work is to develop prosody models to capture the story semantics present in the Hindi children stories. In this work, we have considered three prosodic parameters to be modeled: duration, intonation, and intensity of the syllables. We have proposed textual (positional, contextual and phonological) and storysemantic features for modeling the prosody. The proposed prosody models will be integrated with the TTS framework to synthesize speech in storytelling style. We used Classification and Regression Tree (CART), Feed-forward neural network (FFNN), and Support Vector Machine (SVM) for modeling the prosody. We pose the problem of modeling the prosody as a data driven statistical transformation from input text onto the feature space to capture the implicit prosodic and semantic knowledge of the syllables. The evaluation of the models was carried out using objective and subjective measures. A framework is designed to further improve the performance of the prosody models by including the story genre specific prosody modeling. The three genres of the story considered are Folk-tale, Legendary, and Fable. The subjective evaluation of the synthesized story speech shows the proposed framework is capable of improving the storytelling style.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phonological Mean Length of Utterance in 48-60-Month-old Persian-speaking Children with Isfahani Accent: Comparison of Story Generation and Conversation Samples

Objective:Phonological Mean Length of Utterance (PMLU), a quantitative measure for assessment of phonological skills, has been considered in developmental studies as a diagnostic and clinical criterion in phonological development. Moreover, it is an indicator rate of the efficacy of the intervention. The PMLU is a word level measure that can be calculated on the child’s transcribed speech sampl...

متن کامل

Architecture Narration: A Comparative Study on Narration in Architecture and Story

The way architects think about different issues from developing plans, perspectives, and views to cross-sections and structure of a building is a common and general one. Regardless of its merits and efficiency, this way of thinking indicates a degradation in architectural thinking. Indeed, architectures today are caught in a specific architecture language where the boundaries of language create...

متن کامل

Analysis of the image of to tie up Zahak in Tahmasbi and Rashida Shahnameh based on Gerard Genets narratology approach

The drawing of Zahak is one of the subjects that have been the focus of painters.  In many Shahnamehs, including the Shahnameh of Tahmasabi and Rashida, this issue is depicted in a different way. The purpose is to compare the pictures with the text and with each other. Gerard Genet is one of the theorists who has researched in the field of narratology, he is one of the structuralists and has of...

متن کامل

Analysis and Comparative of Grimas Actor Model in Pictures and Story of Sheikh Sanan

Attention to the narrative structure of stories from the twentieth century followed the science of linguistics and then semiotics as a branch of science. The narration of Sheikh Sanan is the most famous and longest story of Attar, which has been analyzed many times in terms of literature, narration and artistic since its creation. In the field of narration, Grimas proposed a comprehensive rule ...

متن کامل

The Study of “the World as an Image” in the Narrative of “The story of Siavash” Using Genette’s Theory

Narrative process and its narrative mechanisms help the reader make sense of the way events happen in a story. Using repeating images in the text of a story is a method of narrative development.  In Shahnameh, dealing with the world and images it gives rise to is one of the central motives of the text. The narrator in different parts of the poem seems captivated by the image of the world and th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016